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SPECIFICATION 

1, Title of the Invention: 

Signal separation method, signal processing apparatus, image 
5 processing apparatus, medical image processing apparatus and 
storage medium for restoring multidimensional signals from 
observed data in which multiple signals are mixed 

2. Detailed Description of the Invention: 
10 [Field of the Invention] 

The present invention relates to a signal separation method 
for separating specific multidimensional signals from 
observed data, and more particularly to a signal separation 
method for stably estimating and separating specific 
15 multidimensional signals when multiple signals have been 
observed in a mixed state. 

[Background Art] 

As seen in an example of the observation of multiple frames 
20 in which text have been overwritten on a background image 
for such as moving pictures, there are situations where 
multiple multidimensional signals that are originally 
independent are mutually mixed and observed in plural 
conditions. A technique for determining signals prior to 
25 mixture only from observed signals, in which multiple 
multidimensional signals have been mixed mutually, is 
expected to be used in various fields in the future. These 
fields include, for example, separation between character 
information and image information when acquiring images from 
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a scanner or the like, restoration of medical images for 
extracting necessary information, enhancement and 
restoration of blurred images, noise reduction by image 
reconstruction using feature extracting components of 
5 observed images interfered by noise, classification and 
recognition of face data and multidimensional data using 
feature extracting components of observed multidimensional 
[I signals such as an image, etc. 

o 

^ 10 Fig. 1 depicts a conceptual diagram showing signal 
fll separation problems for separating original signals, 

assuming only the mutually statistical independence of 
& signals when multiple signals are observed in a mixed state. 

Z Fig. 1 is formulated as follows. First, assume that there 

fy 15 are m signals of scalar values Si(t), s m (t) for each 

% index t, which are mutually statistically independent and 

H have zero mean. Each signal s is to be obtained by 

appropriately transforming a multidimensional matrix into 
vectors such that two-dimensional signals are transformed 
20 into a line of vectors in a raster scan order, for example. 
However, an arithmetic expression may be modified 
equivalently such that the observed signals are dealt with 
as a matrix as it is. 



25 Fig. 1 shows two signal sources Si and s 2 by way of example. 
For this, n linear weighted sums Xi(t), x n (t) are to be 

observed, which is expressed as follows and observed by an 
observation apparatus. 
[Equation 1] 
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x(t)== As(t) 

where each element is represented as follows. 
[Equation 2] 

xCt) = [Xi(t)X2(t)--X fi (t)] T 
5 s(t) = [Si(t)s 2 (t)-S 2n (t)] T 

where it is assumed to be n^m. Furthermore, assuming that n 
xm mixing matrix is A_, which is to be a full rank matrix, 

i.e., a matrix where an inverse matrix of mxm matrix 
exists. Hereinafter, a lowercase letter with an underline 
10 represents a vector, an uppercase letter with an underline 
represents a matrix, a subscript T represents transposition, 
and a subscript H represents Hermitian conjugate (i.e., 
conjugate transposition) . 

A problem of estimating a separation matrix for obtaining 
15 a separation signal y(t) from this observed signal x(t) is a 
so-called signal separation problem. That is, when 

obtaining a separation signal y(t) = ^xx(t) using a signal 

separation apparatus from an observed signal x(^), which was 
observed by an observation apparatus shown in Fig. 1, 

20 estimation of a separation matrix W_ becomes a problem. 

Next, a summary of the concept for estimating a separation 
matrix W_ will be described. 

Assuming that a mul tivariable probability density function 

25 of a signal vector serving as observed signal x(t) is Pu(u) 
and a probability density function for each element of the 
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vector is pi(ui), a mutual information of an observed vector 
is represented by the following Kullback-Leibler divergence, 
[Equation 3] 



i(u)=Jpu(u)iog 



r ^ 



5. where the mutual information is always positive and when it 
is zero shows that the elements of each signal vector are 
independent. In fact, if the signal vector elements are 
independent each other, the density function of the signal 
yi vector is represented by the following equation, so that the 

f~ 10 above equation becomes zero, 
s [Equation 4] 

O Pu(u)= ftpi(Ui) 

y| Therefore, one of the rationales of signal separation 

technique is that the original signals are able to be 
15 restored from the mixed observed signals by finding a 
transformation matrix that minimizes the mutual information 
of signal vectors for observed signal vectors. 



However, as the probability distribution of original signals 
20 is practically unknown, the mutual information can not be 
made directly to be an object of minimization operation. 
Therefore, the signal separation is often performed by 
optimizing a valuation amount that is equal or approximately 
equal to the mutual information. For example, literature 1 
25 "International Journal of Neural Systems", vol. 8, Nos. 5 & 
6, pp. 661-678, October/December 1997, describes that a 
mutual information is able to be minimized if finding a 
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transformation matrix W_ that optimizes the sum of the 
fourth-order cumulants with a zero time delay for each 
original signal (i.e., maximizing if the kurtosis is 
positive or minimizing if the kurtosis is negative) , on the 
5 condition that the observed signals have a kurtosis with the 
same sign, a covariance matrix is bounded, whitening has 

been performed, and a separation matrix is a unitary 

matrix (i.e., f£ =X (unit matrix)). Note that the kurtosis 
refers to a numeric obtained by the following calculation 
10 for an observed signal u 2 . 
[Equation 5] 

E{u/}-3[E{ Ui 2 }] 2 

where £[•] represents an expectation operation. The 
whitening means making signal vectors uncorrelated each 
15 other to make the variance 1, the fourth-order cumulant is a 
statistic represented by the following equation. 
[Equation 6] 

c 4 (ki, k 2f k 3 ) = E{u 1 (t)u i (t + k 1 )u 1 (t + k 2 )u i (t + k3)} 

-E{u x (t)u 1 (t + k l )}E{u 1 (t + k 2 )u 1 (t + k 3 )} 
20 -E{ u 2 (t)u : (t + k 2 )}E{ u 2 (t + ki)ui(t + k 3 ) 

-E{u 1 (t)u I (t + k 3 )}E{u I (t + k l )u 1 (t + k 2 
The zero time delay means that k l7 k 2 and k 3 are zero in the 
above equation. 

25 However, generally, as a load of calculation is heavy when 
calculating high order statistics such as cumulants, a 
technique is employed such as calculating and approximating 
another information amount equivalent to a mutual 
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information or minimizing a cost function equivalent to what 
optimizes the sum of cumulants by introducing a nonlinear 
function that can approximate the fourth-order cumulants* 
U.S. patent No. 5706402 discloses a method for finding a 
5 separation matrix by the gradient method using an 
unsupervised learning algorithm that optimizes output 
entropy instead of minimization of mutual information. 

Though literature 2 (Signal Processing, vol. 24, No. 1, pp. 

10 1-10, July 1991) does not manifest mutual information and 
cumulants, it discloses a method for using an approach 
similar to it, wherein a square of the residual that results 
from subtracting a linear sum of estimated signals from the 
observed signal is made to be a cost function and finding a 

15 separation filter that minimizes the cost function by the 
gradient method. Moreover, Japanese Unexamined Patent 
Publication No. 2000-97758 discloses a method for improving 
the convergence by normalizing updated amounts of the above 
method . 

20 

Literature 3 (IEEE Transactions on Signal Processing, vol. 
44, No. 12, pp. 3017-3030, December 1996) proposes an 
estimation method, wherein a nonlinear function that 
approximately finds the fourth-order cumulants is 
25 introduced, then updated amounts to optimize the cost 
function in an adaptive algorithm based on that nonlinear 
function are determined based on the relative gradient. 
This technique improves the convergence speed of the 
conventional adaptive algorithm, which uses a gradient of 
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the cost function as the updated amount, and which is 
equivalent to the natural gradient that may be introduced 
from information geometric considerations. 

5 Stability in the convergence process of the separation 
matrix is important when restoring signals not in a steady 
state. In fact, in a series of gradient methods described 
above, it is often the case that the relation between the 
convergence speed and the stability is an inverse 

10 proportion. Thus, U.S. Patent No. 5999956 uses a method 
that adds a module for reducing the effect on the estimation 
process even when there is a big change of power between 
estimated signals, and outputting stable results, in 
addition to a signal estimation module and a separation 

15 coefficient estimation module in order to achieve a stable 
convergence . 

Furthermore, literature 4 (International Journal of Neural 
Systems, vol. 8, No. 5 & 6, pp. 601-612, October/December 

20 1997) derives an adaptive algorithm based on the least 
squares method instead of the gradient method, when 
optimizing a cost function that introduced nonlinear 
function. Using this approach, as a step-size is not 
determined by a user like in the gradient method and what is 

25 optimal is determined automatically, the convergence speed 
is enhanced and the stability is achieved under a given 
condition . 

[Problems to be Solved by the Invention] 
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Like the technique of literature 4 above, within the 
framework of the least squares method, it has been 
considered that a fast and appropriate convergence is often 
achieved, since a step-size is calculated to be optimal 
5 under the cost function. However, there is not necessarily 
the conformance between the situation where the signal 
separation is required and the format of the cost function 
which the above prior art techniques including the gradient 
method have been employed, so that there is a case where it 
10 seems not to be best even when using the framework of the 
least squares method. 

For example, in the case of moving pictures, there would be 
a little change of images in adjacent frames. Therefore, it 

15 is easily assumed that an original signal is mixed in the 
adjacent frames at a similar ratio, where a component of 
each column (or each row) has about the same value when 
representing the mixing ratio as a matrix element. In such 
a case, as the condition number of the mixing matrix becomes 

20 large, the perturbation in the estimation process of the 
separation matrix would have large effects in estimates. 
Note that the condition number refers to an amount defined 

by II Z II • II Z" 1 ||using some norm || • || for a matrix Z_, where Z" 1 

represents an inverse matrix of a matrix Z_. 

25 

Therefore, in the conventional format of the cost function, 
much time is spent for obtaining normal estimates when the 
perturbation is large, which is likely to be a problem. 
Further, it is another problem that when the condition 
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number is not large, the convergence speed becomes slower 
than the conventional cost function in the stage where 
errors still remain in the estimation process. 

5 The present invention is proposed in view of above technical 
problems, therefore, its object is to stably extract 
original (and separate) signals from multiple signals by a 
few calculation steps when multiple signals ' including 
multidimensional signals have been observed in a mixed 
10 state. 

It is another object of the invention to provide a least 
squares type method of which convergence speed is faster 
based on the cost function that reflects the change of 
perturbation by proper weighting, when the perturbation on 
15 estimates is large upon signal separation for observed 
signals, in which multiple signals including 

multidimensional signals are observed in a mixed state. 

[Summary of the Invention] 

20 In order to achieve the above purposes, the present 
invention introduces a function having a monotonously 
increasing characteristic like an exponential type function 
as a cost function, which has an effect equivalent to 
minimizing a mutual information of observed signals, and 

25 applies an adaptive algorithm that minimizes (optimizes) 
that cost function in terms of a signal separation matrix. 
As this optimization of the exponential type function 
obtains the results equivalent to the optimization of 
H-infinity norms and the optimization by a solution of 
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2-person zero-sum game for the game theory, so-called MinMax 
strategy, it is also possible to estimate a signal 
separation matrix by an adaptive algorithm from H-infinity 
norms and MinMax strategy. That is, the present invention 
5 provides a signal separation method for restoring original 
multidimensional signals from observed data where multiple 
multidimensional signals are mixed, the method comprising 
the steps of: selecting a specific separation matrix by 
optimizing at least either one of H-infinity norm type of 

10 cost function or MinMax strategy type of cost function for 
the observed data; and multiplying the observed data by the 
selected separation matrix to restore the original 
multidimensional signals. These original multidimensional 
signals refer to, for example, original signals composed of 

15 multidimension such as original image signals. 

In another aspect of the present invention, there is 
provided a signal separation method for estimating and 
restoring original multidimensional signals from observed 
data where multiple signals including the original 

20 multidimensional signals are mixed, the method comprising 
the steps of: introducing a cost function for the observed 
data, the cost function being based on a function that has a 
monotonously increasing characteristic; estimating a 
separation matrix using an adaptive filter that optimizes 

25 the introduced cost function; and multiplying the observed 
data by the estimated separation matrix to estimate and 
restore the original multidimensional signals. 

The step of estimating a separation matrix comprises using 
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the adaptive filter that minimizes the cost function in 
terms of the separation matrix, wherein the introduced cost 
function is an exponential type function, thereby reducing 
the time for calculating the normal estimates even when the 
5 perturbation on estimates is large in the estimation process 
of the separation matrices. 

Multiple signals that are included in the observed data may 
comprise other multidimensional signals or one-dimensional 
signals as well as the original multidimensional signals. 

10 

In another aspect of the invention, there is provided a 
method for separating and extracting original signals from 
observed signals where multiple multidimensional signals 
including the original signals are mixed, the method 

15 comprising the steps of: reading the observed signals; 
transforming a data structure in terms of the read observed 
signals; subtracting an average of the observed signals of 
which data structure has been transformed to perform zero 
averaging; performing whitening on the observed signals that 

20 have undergone zero averaging; performing separation 
processing on the observed signals that have undergone 
whitening based on a cost function having a monotonously 
increasing characteristic; and performing inverse whitening 
as post-processing on the observed signals that have 

25 undergone the separation processing. A nonlinear function 
used in the cost function in the separation processing can 
be changed based on kurtosis of the read observed signals. 

In a further aspect of the invention, there is provided a 
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signal processing apparatus, comprising: means for inputting 
observed data where multiple signals including original 
multidimensional signals are mixed; means for selecting a 
specific separation matrix by optimizing at least either one 
5 of H-infinity norm type of cost function or MinMax strategy 
type of cost function for the input observed data; and means 
for multiplying the observed data by the selected separation 
matrix to estimate and restore the original multidimensional 
signals . 

10 

In a further aspect of the invention, there is provided a 
signal processing apparatus, comprising: means for inputting 
observed data where multiple multidimensional signals that 
are originally independent are mixed mutually; means for 

15 estimating a separation matrix by using an adaptive filter 
for the input observed data, the filter optimizing a cost 
function based on a function that has a monotonously 
increasing characteristic; and means for multiplying the 
observed data by the estimated separation matrix to estimate 

20 and restore the specific multidimensional signals. 

The observed data input by the input means comprises given 
image data that consist of mixtures of moving picture data 
that are considered as three dimensional data, whereby 
multiple multidimensional signals containing moving pictures 

25 that are originally independent are separated stably. 

In a further aspect of the invention, there is provided an 
image processing apparatus, comprising: means for inputting 
mixed image data where multiple image signals are mixed; 
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means for estimating a separation matrix by using an 
adaptive filter for the input mixed image data, the filter 
optimizing a cost function (i.e., minimizing the cost 
function in terms of the separation matrix) based on a 
5 function that has a monotonously increasing characteristic 
as typified by an exponential type function; and means for 
multiplying the mixed image data by the estimated separation 
matrix to separate and extract the image signals from the 
mixed image data. 

10 

The input mixed image data comprises text overwritten on a 
background image and are observed as multiple frames, 
wherein the image data prior to mixture can be determined 
even when multiple image data that are originally 

15 independent are mixed mutually in multiple states. 

The image signals that are separated and extracted are image 
signals that are used in face recognition and distinction in 
biometrics or image signals that are obtained by the 
observation and measurement by satellites, wherein the 

20 present invention can be used effectively for an image 
processing apparatus that is applied in these fields. 

In a further aspect of the invention, there is provided a 
medical image processing apparatus, comprising: means for 
25 inputting medical image data where a signal change of which 
interference process is unknown is contained in observed 
bio-image signals such as fMRI (functional magnetic 
resonance imaging) ; means for estimating a separation matrix 
by using an adaptive filter for the input medical image 
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data, the filter optimizing a cost function based on a 
function having a monotonously increasing characteristic; 
and means for multiplying the medical image data by the 
estimated separation matrix to separate and extract the 
5 observed bio-image signals from the medical image data. 

In a further aspect of the invention, there is provided a 
storage medium tangibly embodying a program readable and 
executable by a computer, the program comprising: processing 

10 for introducing a cost function for observed data where 
multiple multidimensional signals are mixed, the cost 
function being based on a function that has a monotonously 
increasing characteristic; processing for estimating a 
separation matrix using an adaptive filter that optimizes 

15 the introduced cost function; and processing for multiplying 
the observed data by the estimated separation matrix to 
estimate and restore the specific multidimensional signals. 
Also, the program comprises: processing for selecting a 
specific separation matrix by optimizing at least either one 

20 of H-infinity norm type of cost function or MinMax strategy 
type of cost function for observed data where multiple 
multidimensional signals are mixed; and processing for 
multiplying the observed data by the selected separation 
matrix to estimate and restore the specific multidimensional 

25 signal. 

These storage media include CD-ROM media, for example, 
wherein the program is read by a CD-ROM reader in a computer 
system and stored on a hard disk drive in the computer 
system to be executed. 
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[Preferred Embodiments] 

Now the present invention will be described in detail with 
reference to the accompanying drawings. 
5 A signal separation method according to the invention is 
applied to the field of extracting an accurate change of 
brain activity when restoring a medical image to extract 
necessary information from medical images such as fMRI 
(functional magnetic resonance imaging). It is also applied 

10 to extraction of an original image prior to blurring by 
enhancing and restoring a blurred image, or noise reduction 
by means of image reconstruction using feature extracting 
components of an observed image that interfered by noise. 
Furthermore, it is conceivable that the present invention 

15 will be applied to classification and recognition of face 
data or multidimensional data by means of feature extracting 
components of observed multidimensional signals of images or 
the like, including face recognition in biometrics or 
geographic analysis from satellites. That is, a signal 

20 separation method according to the present invention is 
widely applicable to a problem that needs to separate 
original signals only from signals where a general 
multidimensional signal train is observed in mixed state, 
irrespective of the state of the digital observed signals or 

25 analog observed signals, complex signals or real signals, 
etc., wherein the application of its derivative forms may be 
widely derived, so being not limited to the above examples. 
Note that a signal separation technique according to the 
present invention can be useful with a dedicated apparatus 
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or a terminal, as well as a computer system having a variety 
of memories, controllers, and a display, etc. 



Next, a signal separation method according to the present 
5 invention will be described in detail. First, by 

introducing a nonlinear function that can approximate a 
fourth-order cumulant into a cost function and making its 
minimization have a meaning equivalent to minimization of 
mutual information, a scheme to estimate a separation matrix 
- § H 10 represented as follows is considered. 
[Equation 7] 

Separation Matrix 
Conventionally, a cost function used in such a scheme is the 
following format, 
15 [Equation 8] 

E{ \\x-Wf(W H K) || 2 } 
where £(•) is a vector that has a nonlinear function such as 
tanh(-) as an element. 



20 According to the present invention, a signal separation 
method is proposed that minimizes the following type of cost 
function in terms of the above separation matrix. That is, 
[Equation 9] 

n 

Ji(W) = 7 2 In E{exp(y~ 2 \\ x-Wf(W x)|| 2 )} 
25 Then, assuming that a difference between an initial true 
value of the separation matrix and an estimated initial 
value, and a noise generated in the estimation process are 
zero mean and decorrelated, and an estimated noise is a 
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white random variable having unit variance. A white random 
variable means that vx and vj for any i and j (i=£ j) are 
an independent stochastic variable in a variable series [v*]. 
In the description below, an index (t) may be added instead 
5 of a hat symbol that indicates an estimate of the separation 
matrix, as long as there is no need to distinguish between a 
true value and an estimated value. 

Minimization of the above cost function means minimization 
10 of the following equation in view of that a logarithmic 
function ln(-) is a monotonously increasing function. 
[Equation 10] 

E{exp(y~ 2 || x-ff(i\) || 2 )} 
In addition, though various proposals have been provided as 

15 to what kind of nonlinear function should be used, it is 
common to use threshold processing for changing a function 
to be used depending on whether the kurtosis of the observed 
signals is positive or negative, thereby selecting a 
function that enables an appropriate approximation of high 

20 order cumulants. Other various types of forms are also 
conceivable. Functions to be used include tanh(u), u 3 , or 
u-tanh(u), etc. For example, u-tanh(u) is used when the 
kurtosis is positive, whiletanh(u) is used when the kurtosis 
is negative. 

25 

Fig. 2 depicts a block diagram showing a first algorithm for 
estimating a separation matrix that minimizes a cost 
function. The algorithm shown in Fig. 2 is represented as 
follows . 
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[Equation 11] 

y(t) = f(w H (t-l)x(t)) 
h(t) = P(t-l)y(t) 
g{t)=h(t)/{fi+y H (t)h(t)) 

a=l-y~ 2 

£ = {J3+ y H ( t)h( t) } I {/3+ay H ( t)h( t) } 
Ht) = j{Et-l)-aCg{t)h H (t)} 
e(t) = x(t)-Mt-l)y(t) 
W(t) = f(t-l) + e(t)g H (t) 

Fig. 2 depicts a flow of main algorithm by a nonlinear 
function 21, calculation of an error signal e(t) 22, update 

of W(t) 23, and a uni tari zation operation 24, wherein 
various amounts necessary for this main algorithm are 
calculated by each unit including calculation of hit) 25, 
calculation of g(t) and c 26, and calculation of P(t) 27. 
z" 1 is a delay for timing adjustment. As shown in Fig. 2 and 
the above algorithm, first an observation signal x(t) is 
input, and then y(t) is calculated by the nonlinear function 

21 that is determined depending on whether the kurtosis is 
positive or negative. This nonlinear function 21 is 
constituted such that the nonlinear function 21 is operated 

on a product of an estimated separation matrix W(t—1) that 
was estimated at a previous time (i.e., previous cycle) and 
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an 



observed signal x(t) at a present time. Thereafter, y(t) 

is input to the calculation of an error signal e(t) 22, 
where a difference is calculated between the observed signal 
x(t) at that time and a product of the estimated separation 
5 matrix W(t — l) at a previous time (i.e., previous cycle) and 
the calculated y(t) . Next, the update of the separation 
matrix W(t) at that time is performed in a block 23. In 
this update of W(t) 23, W(t) is updated by multiplying 
W(t-1) estimated at the previous time by the error signal 
10 e(t) and g H (t). The unitarization operation 24 is a step for 

assuring a unitary matrix after the update of W(t) f where a 
given operation is preferably performed for improving the 
accuracy. This is expressed as follows. 
[Equation 12] 



5 15 Mt)=«t){(w(t) H Mt)r i } l/2 



In the calculation of h(t) 25, h(t) is calculated fromP(t-l) 
at the previous time and y(t). In the calculation of g(t) 
and £ 26, each value is calculated using a forgetting factor 
20 /?. This forgetting factor /? is a constant of 0</?^l, which 
may be usually 1, however, when a matrix to be estimated 
varies with time, this forgetting factor /? is set to smaller 
than 1 such as 0.99 or 0.98 to enable tracking for this 
varying. In the calculation of P(t) 27, P(t) is calculated, 
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which is an inverse matrix of a covariance matrix of At), 
from h(t) calculated in the calculation of h{t) 25, input a, 
the forgetting factor fi , and <?(t) and C calculated in the 
calculation of g{t) and t 26. Note that initial values P(0) 
5 and J^(0) are arbitrarily selected for these algorithms* 

Now assuming that an observed signal x(t) undergoes 
preprocessing for transforming into a signal having a zero 
mean and performing whitening, before the above estimation 
10 processing of a separation matrix. This preprocessing is 

performed on the observed signal 2i(t) and needs no 
information of the original signal. Whitening means that 
each element of the observed signal vector is made 
uncorrelated to make the variance 1, a technique of which 
15 may not be specified and possible using eigenvalue 
decomposition or principal component analysis, for example. 

In this way, the signal separation method according to the 
present invention is based on optimizing a cost function Ji 

20 based on an exponential function in terms of a separation 
matrix. Therefore, from this point of view, a similar 
result would be obtained by using other algorithms. 
Fig. 3 and Fig. 4 depict a configuration of the second 
algorithm. Fig. 3 depicts an overall configuration of a 

25 second algorithm, while Fig. 4 depicts an estimation 

filtering of jfi(t). This is what the first algorithm is 
transformed by deflation. In this second algorithm, 
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fi 



optimization is performed by estimating a column vector one 
by one to reduce the order in turn, as shown in Fig. 3. For 
example, estimation filtering of f£i(t) i s performed on the 
observed signal *i(t) in block 31, then estimation filtering 
5 of W[2(t) is performed on the observed signal ^(t) in block 
32, and in the same manner, estimation filtering of Wm(t) is 
performed on the observed signal x m (t) in block 33, thereby 
finally deriving the estimated separation matrix W(t) . 
That is, in this second algorithm, an operation shown in 
10 Fig. 4 is repeated for each Xi(t) = x(t) r i = 1, m. 

The algorithm shown in Fig. 4 is represented as follows. 
[Equation 13] 

yi (t)=f(^(t-l)xi(t)) 

15 a = l -y- 

ei(t)=Xi(t)-^(t-l) yi (t) 

C~ fidilt-iytay^tyydt) 

i if i ~ yAtTyiW x 

d,(t) ~ P \ dAt-l) "<* d 1 (t-l){ySd i (t-l)+y 1 (t)*y J (t)} / 

^(t)=^(t-l)+ei(t)[ yi (t)V{/?di(t-l)+yi(t)yi(tr}] 
20 x i+1 (t)=x I (t)- Wj.(t)yj(t) 

where d^O) is any initial value, a subscript * represents 
conjugate. f(-) represents a nonlinear function such astanh(«) 

In Fig. 4, a flew of main algorithm is represented by a 
25 nonlinear function 41, calculation of an error signal 6j(t) 
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42, update of w±(t 43, and update of Xj+i (t) 45, wherein 
various amounts necessary for this main algorithm are 
calculated by each unit including calculation of c 46 and 
calculation of d±(t) 47. z' 1 is a delay for timing 

5 adjustment. The accuracy is enhanced by performing a 
unitarization operation when each w±(t is found or when all 

q are found. For example, in the case of a real vector, 

Gram-Schmidt orthogonalization process may be applied, 

fij 

however, there is no need to restrict a method. 

SSSU 

u 10 

U Fig. 5 depicts a flow of processing from data reading to 

data outputting according to the present invention. First, 
Q an observed signal x(t) that is composed of multidimensional 

3 EJS 

yp signals is read (step 101), then a transformation of data 

H 15 structure is performed, which transforms matrix information 
such as an image into a vector (step 102) . Thereafter, it 
is determined whether to use a predetermined function as a 
nonlinear function used in the algorithm or to change a 
function depending on the kurtosis (steps 103 to 105) . That 
20 is, it is determined whether a predetermined function is 
used or not (step 103), if so, the predetermined function is 
set (step 104), otherwise the function is set depending on 
the determination of kurtosis (step 105) . Next, zero 

averaging of the observed signal x(t) is performed, that is, 
25 an average of the observed signal x(t) is subtracted to make 
the average be zero (step 106), then whitening of the 
observed data is performed (step 107) . Next, from the 
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beginning of data to the end, or from the beginning to a 
predetermined data point, separation processing is 
recursively performed, which is based on an exponential 
function type of cost function according to the present 
5 invention (step 108), then the inverse whitening is 
performed as a post-processing (step 109), and finally the 
output result is obtained (step 110) . As the separation 
processing, the configuration based on the first algorithm 
shown in Fig. 2 or the configuration based on the second 
10 algorithm shown in Fig. 3 and Fig. 4 may be used. 



[I Next, characteristic of a signal separation method according 

E to the present invention will be described. As previously 

stated, a cost function that has been used before is the one 
W 15 that has the following form based on H 2 norm. 

&B, Ji 

5 [Equation 14] 

J 2 (W) = E{ || x-Wf(W x) || 2 } 
Such type of cost function estimates a separation matrix 
with equal consideration weight even when some kind of 
20 perturbation such as an estimation error is added to the 
estimated vector (represented by the following equation) or 
when the convergence is obtained. 
[Equation 15] 

x = Wf_(w H x) 

25 On the other hand, according to an exponential function type 
of cost function of the present invention, as the error 
increases, the consideration weight involved in cost 
function increases, so that the convergence is accelerated, 
while when close to convergence, a stable estimation is 
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expected so as not to accelerate the unnecessary update. 
Fig. 6 depicts a conceptual diagram showing a degree of 
consideration of an error in the cost function. The 
horizontal axis shows the value of the error, while the 
5 vertical axis shows a degree of consideration. An 
approximate straight line in the drawing indicates the prior 
scheme, while a curved line with a downward convex shape 
indicates the scheme of the present invention. According to 
the present invention, in the initial stage of estimation 

10 where an estimation error is large, a large update is 
performed with an optimal update amount, while in the stage 
where the estimation proceeds, an appropriate update for 
fine tuning is performed steadily, so that consequently the 
estimated result with high-precision is obtained by a small 

15 number of steps. 

As an example that greatly differs from the prior art, there 
is a case where the difference between frames is small when 
performing separation of character information and 

20 background information from adjacent frames in image 
sequences. At this time, components of each row or column 
of a mixing matrix A_ have the same value, wherein the 
condition number of the mixing matrix A_ becomes large. As 
the condition number becomes large, the perturbation of the 

25 estimation error or the like has a great influence, thus the 
prior scheme is likely to need extra steps for convergence 
even when a small perturbation occurs. On the other hand, 
according to the present scheme, as the consideration weight 
in the cost function becomes large when the large 
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perturbation occurs, an effect equivalent to making a 
correction large is possible in the update equation in the 
algorithm* As a result, according to the present scheme, it 
is expected to accelerate the convergence . Of course, 
5 according to the present invention, as is evident from the 
above consideration that the convergence is accelerated 
compared with the prior scheme when the condition number is 
not large, so that the separation matrix with high-precision 
is expected to be estimated more quickly* That is, 
10 according to the present scheme, an update amount that tries 
to suppress an estimation error as much as possible is 
improved under a more appropriate cost function than the 
prior one. 

Moreover, as is evident from the above algorithm, the 
15 present scheme is considered to have a structure of adaptive 

filtering, so that even when a mixing matrix A_ is 
time-varying such that the mixing ratio changes in the 
middle of images or the mixing ratio changes in the middle 
of images sequences, multidimensional signal separation with 
20 tracking the time-variation is enabled. 

According to the above explanation, provided that an 
exponential function type of cost function is used and its 
calculation method is to optimize the cost function Ji, the 
25 present invention is not limited to the above-mentioned 
form. A similar algorithm may be derived from a cost 
function that is based on an approximate expression where an 
exponential function is expressed in a polynomial expansion, 
or a cost function that is based on a function having a 
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similar monotonously increasing characteristic* Now, a cost 
function will be described, which has the same meaning as an 
exponential function type of cost function. That is, by 
deriving an estimation algorithm of a separation matrix 
5 based on two cost functions described below, an algorithm 
similar to the ones shown in Fig. 2, Fig. 3 and Fig. 4 can 
be derived. 
[Equation 16] 

10 

Ji 2 (W)=minmax [E{ \\ Hf(W H x)-Wf(W x) \\ 2 } 

-y z {(w -wjm^iw-wj 

+E{\\x-W£E H x) II 2 }}] 
where Ho is a positive-definite matrix representing a degree 
15 of uncertainty for an initial value £o , £. is a true value 
of a separation matrix. Note that a positive-definite 
matrix means a matrix M_ where v H Mv becomes positive for all 
non-zero vectors v_. 

20 Jn is an H-infinity norm and the format of this cost 
function means that no matter how large an estimation error 
exists, a method can be derived which estimates a separation 
matrix, wherein the upper bound of H 2 norm of estimation 
errors is less than or equal to a predetermined scalar 

25 quantity. A cost function J 12 means that a method for 
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optimization can be derived in a strategy using the MinMax 
theorem of the game theory, wherein a separation matrix that 
generates a minimum error is selected as a solution, from 
among the candidates of a maximal value of errors that arise 
5 for various separation matrices. What these cost functions 
mean is consistent with the features of the signal 
separation method derived from the exponential function type 
of cost functions described above. 

Next, the results of the experiment will be described when 
10 applying a signal separation method according to the present 

invention, wherein three kinds of synthetic signals were 

prepared to conduct an experiment of their separation. 

These synthetic signals are as follows. 

[Equation 17] 
15 ri = sin(2?z60 t) 

r 2 =sign(sin2nb00t)) 

r 3 : random variables uniformly distributed between [0,1] 
where the sampling frequency was 10 kHz, sign(-) represents a 
function that outputs a sign of (♦) . 

20 The methods of the aforementioned literature 3 and 
literature 4 were used as a comparative object. The method 
of literature 3 is the one that is based on a relative 
gradient of which convergence is said to be fast among 
gradient methods, while the method of literature 4 is a 

25 conventional least squares type of algorithm. As a 
comparative object, the algorithm of the present invention 
shown in Fig. 2 is employed. It was assumed that a mixing 
matrix is generated from random variables that are uniformly 
distributed between [0, 1] and that the condition numbers 
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are over 2000. 

Fig. 7 depicts the results of convergence when conducting 
the independent trial ten times and taking an average of 
5 them in the experiment of separation. The horizontal axis 
represents the number of iterations, while the vertical axis 
represents an index value of convergence, wherein the 
characteristics of convergence by using synthetic signals 
are shown corresponding to when using a technique of 

10 literature 3, a technique of literature 4, and a technique 
of the present invention. As the characteristic index of 
convergence at this time, the following index used in the 
literature 4 is employed, which indicates that the precision 
of estimation becomes higher when approaching zero. 

15 [Equation 18] 



where pij represents (i, j) element of a matrix that is 
obtained as a product of an estimated separation matrix and 



20 requires that a user determines the update step-size, here a 
step-size that achieves the fastest convergence is used 
among the ones that achieve the convergence at the same 
level of stability as the present invention. However, as is 
evident from Fig. 7, the convergence speed of the technique 

25 of literature 3 is slower than the technique of the present 
invention. On the other hand, although a step-size can be 
automatically set to an optimal one according to the 
technique of literature 4 and the technique of the present 




a mixing matrix A_. 



As the technique of literature 3 
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invention, it is evident that the technique of the present 
invention achieves a faster convergence than the technique 
of literature 4. 

5 As the result of examination of the number of FLOPS 
(Floating Operations Per Second), i.e., the number of 
floating operations that can be processed per second for 
y* respective techniques in the above experiment, it proved 

y that the number of FLOPS for one step is nearly the same for 

fU 10 the present invention and the literature 4. Therefore, it 
£ is understood that the total number of FLOPS to convergence 

M~- is smaller for the technique of the present invention. On 

the other hand, comparing the literature 3 and the present 
H* invention, the number of FLOPS required for one step is 

15 smaller for the technique of literature 3, however, the 
simulation proved that the total number of FLOPS required 
for satisfying convergence is reduced to less than 2/3 by 
the present invention. When using the algorithms shown in 
Fig. 3 and Fig. 4, nearly equal results can be obtained. 
20 Moreover, when not making the condition number of mixing 
matrix A_ an unfavorable condition like this simulation, a 
similar result can be obtained. 

Fig. 8 through Fig. 10 are diagrams for illustrating the 
25 results of the separation of image signals by the present 
invention as an example of the separation of real signals. 
Figs. 8(a) to 8(C) depict image signals composed of three 
original images, wherein Fig. 8(a) uses a part of the girl 
image to which the gray transformation was applied, this 
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image existing in the standard image database SI DBA of 
University of Tokyo, while Fig. 8(b) uses a part of the 
standard high precision picture data SHIPP of the Institute 
of Image Electronics Engineers of Japan, to which the gray 
5 transformation was applied. Figs. 9(a) to 9(c) depict mixed 
images where the original images shown in Figs. 8(a) to 8(c) 
were mixed by using a mixing matrix, in which uniform random 
variables were generated. Figs. 10(a) to 10(C) depict the 
results of separation according to the present invention. 

10 According to the separation scheme of the present invention, 
it proves that from the result of separation shown in Fig, 
10(a), image signals of the original image shown in Fig, 
8(a) are extracted. Furthermore, as is shown in Fig. 10(b) 
and 10(c), it proves that image signals of the original 

15 images shown in Figs. 8(b) and 8(c) are separated stably. 

Note that there is a case where plus and minus signs are 
reversed against the signals of original images as the 
result of separation. However, in case of images, an 

20 obvious perception by sight is possible when a phase is 
deviated by n (i.e., relation between negative and 
positive), so that a measure can be taken by reversing the 
sign of the resulting signals of separation. As for the 
amplitude, a user may adjust appropriately. In the images 

25 shown in Fig. 10(a) to (c) , minimum and maximum values of 
luminance of obtained images were adjusted to a displayable 
range. Of course, information about signals of original 
images isn't necessary at all for these manipulations. 
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Next, applications (1) to (4) of the signal separation 
method of the present invention will be described. 
(1) Restoration of a medical image for extraction of 
necessary information 
5 Though a medical image includes a variety of examples, let's 
consider the aforementioned fMRI (functional magnetic 
resonance imaging), for example. fMRI is used to observe 
the change of space distribution for brain activities with 
time when some kind of stimulus is given or when conducting 

10 an indicated action. As the fMRI observes its change based 
on the oxygen level in the blood, it is necessary to restore 
and extract information from weak signals. In order to 
associate an action for the observation objective with a 
brain active region accurately, an extra signal change due 

15 to the heart beat or the movement of a head should be 
removed. Like the conventional manner, accumulating the 
observed images for every observation time period and 
removing noise depending on whether the time series 
correlation of each pixel position is large or small, 

20 results in a problem where the time resolution deteriorates. 
Therefore, assuming that the change of brain activities due 
to an action and the change of brain activities due to other 
factors are independent, a more accurate extraction of the 
change of brain activities is expected compared with the 

25 conventional method, by using a method like the present 
invention for separating and extracting original signals 
adaptively and relatively fast at each time even if the 
interference process is unknown. 
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(2) Enhancement and restoration of blurred images 
Applying the present invention, the enhancement and 
restoration of blurred images is performed by assuming each 
differential term of Taylor expansion of the blurred image 
the independent components and regarding the coefficients of 
Taylor expansion as the elements of the mixing matrix. 
According to the present invention, the original image prior 
to blurring can be extracted even if only the smaller number 
of observed images can be obtained compared with what was 
previously required, or with a smaller amount of calculation 
than the prior art. 

(3) Noise reduction by the image reconstruction using 
feature extracting components of the observed images 
interfered by noise 

After classifying the observed images interfered by noise 
into some independent components by the independent 
components analysis, noise reduction is performed by 
removing the components that are regarded as noise and 
synthesizing using an inverse matrix of the separation 
matrix. As the method of the present invention has 
tolerance to noise, more accurate independent components 
than the prior art can be extracted, thereby allowing more 
efficient noise reduction. 

(4) Classification and recognition of face data and 
multidimensional data using feature extracting components of 
observed multidimensional signals such as an image 

By the independent component analysis of the observed 
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signals (i.e., signal separation) and classifying the data 
by the obtained components, classification with 
higher-precision is expected compared with an analysis that 
considers up to the second-order statistic like before, 
5 since the data analysis is performed from information that 
considers high-order statistics. Also, the method of the 
present invention has tolerance when a signal is distorted 
or when the perturbation exists due to the slight difference 
of the observation direction, thereby allowing the robust 
10 data classification. Therefore, the present invention is 
applicable to face recognition or distinction in biometrics 
or distinction of images that are obtained by the 
observation and measurement by satellites (e.g., identifying 
a forest land, residential land, or sea, etc.). 

15 

As has been described in detail, according to the present 
invention, it becomes possible to stably separate an 
original signal from multiple multidimensional signals with 
faster convergence than the previous scheme, even when 

20 multiple multidimensional signals have been observed in a 
mixed state. The separation scheme according to the present 
invention is also useful in separation for moving pictures 
as well as image data containing color data such as RGB, 
Though the moving pictures are comprehended as 

25 three-dimensional images (i.e., X axis, Y axis and time 
axis), the great effect can be achieved when applying the 
present invention, since the time required for convergence 
could be greatly reduced. 
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[Advantages of the Invention] 

In this way, according to the present invention, it is 
possible to stably extract an original signal from multiple 
signals by a few calculation steps when multiple signals 
5 including multidimensional signals have been observed in a 
mixed state, 

3. Brief Description of the Drawings 

Fig. 1 depicts a conceptual diagram showing signal 
10 separation problems. 

Fig. 2 depicts a block diagram showing a first algorithm for 
estimating a separation matrix that minimizes a cost 
function . 

Fig. 3 depicts an overall configuration of a second 
15 algorithm. 

Fig. 4 depicts an estimation filtering of ^i(t) in a second 
algorithm. 

Fig. 5 depicts a flow of processing from data reading to 

data outputting according to the present invention. 
20 Fig. 6 depicts a conceptual diagram showing a degree of 

consideration of an error in the cost function. 

Fig. 7 depicts the results of convergence when conducting 

the independent trial ten times and taking an average of 

them in the experiment of separation. 
25 Figs. 8(a) to 8(C) depict image signals composed of three 

original images. 

Figs. 9(a) to 9(c) depict mixed images where the original 
images shown in Figs. 8(a) to 8(c) were mixed by using a 
mixing matrix, in which uniform random variables were 
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generated . 

Figs. 10(a) to 10(C) depict the results of separation 
according to the present invention. 

5 [Description of the Symbols] 
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